Discriminative Power And Retrieval Effectiveness Of Phrasal Indexing Terms

نویسنده

  • Sumio Fujita
چکیده

In spite of long controversy, effectiveness of phrasal indexing is not yet clear. Recently, correlation between query length and effect of phrasal indexing is reported. In this paper, terms extracted from the topic set of the NACSIS test collection 1 are analyzed utilizing statistic tools in order to show distribution characteristics of single word/phrasal terms with regard to relevant/nonrelevant documents. Phrasal terms are found to be very good discriminators in general but not all of them are effective as supplemental phrasal terms. A distinction of informative / neutral / destructive phrasal terms is introduced. Retrieval effectiveness is examined utilizing query weight ratio of these three categories of phrasal terms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Notes on the Limits of CLIR Effectiveness: NTCIR-2 Evaluation Experiments at Justsystem

NTCIR-2 evaluation experiments at the Justsystem site are described with a focus on comparative study of CLIR effectiveness with monolingual retrieval effectiveness of the same retrieval engine. Experiments on the effects of phrasal translation, indexing of translated phrasal terms, pre-translation feedback and parallel documents feedback in diverse retrieval settings, are reported. The results...

متن کامل

Notes on Phrasal Indexing: JSCB Evaluation Experiments at NTCIR AD HOC

The evaluation experiments of the JSCB team are described with a focus on noun phrase indexing and its weighting issues in ad hoc text retrieval. Experiments on the effects of supplemental noun phrase indexing in view of the effect of various length of queries are reported. The results show that the noun phrase indexing outperforms single word only indexing with long queries while single word o...

متن کامل

Multi-facet Document Representation and Retrieval

This paper presents our participation in ImageCLEF2011, in the two tasks: ad-hoc image-based retrieval and case-based retrieval, of the medical retrieval track. We participated through a simple IR model based on three hypotheses: 1) the amount of overlap between a document and a query, 2) the descriptive power of an indexing element, and 3) the discriminative power of an indexing element. We us...

متن کامل

Analysis of the Usage of Japanese Segmented Texts in NTCIR Workshop 2

In this paper, we report on the usage of Japanese segmented texts and analyze the submitted search results to NTCIR Workshop 2, which used these texts. In these texts, each sentence is segmented into terms and term components (similar to phrases and words). However, the sizes of terms are inconsistent in the texts; e.g., some terms that should be decomposed into term components remain as terms....

متن کامل

Automatic suggestion of phrasal-concept queries for literature search

Both general and domain-specific search engines have adopted query suggestion techniques to help users formulate effective queries. In the specific domain of literature search (e.g., finding academic papers), the initial queries are usually based on a draft paper or abstract, rather than short lists of keywords. In this paper, we investigate phrasal-concept query suggestions for literature sear...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000